Countering Adversarial Images using Input Transformations

نویسندگان

Chuan Guo

Mayank Rana

Moustapha Cissé

Laurens van der Maaten

چکیده

This paper investigates strategies that defend against adversarial-example attacks on image-classification systems by transforming the inputs before feeding them to the system. Specifically, we study applying image transformations such as bit-depth reduction, JPEG compression, total variance minimization, and image quilting before feeding the image to a convolutional network classifier. Our experiments on ImageNet show that total variance minimization and image quilting are very effective defenses in practice, in particular, when the network is trained on transformed images. The strength of those defenses lies in their non-differentiable nature and their inherent randomness, which makes it difficult for an adversary to circumvent the defenses. Our best defense eliminates 60% of strong gray-box and 90% of strong black-box attacks by a variety of major attack methods.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Countering Adversarial Images

متن کامل

Countering Adversarial Images

متن کامل

Countering Adversarial Images

متن کامل

Improving Transferability of Adversarial Examples with Input Diversity

Though convolutional neural networks have achieved stateof-the-art performance on various vision tasks, they are extremely vulnerable to adversarial examples, which are obtained by adding humanimperceptible perturbations to the original images. Adversarial examples can thus be used as an useful tool to evaluate and select the most robust models in safety-critical applications. However, most of ...

متن کامل

Synthesizing Robust Adversarial Examples

Neural network-based classifiers parallel or exceed human-level accuracy on many common tasks and are used in practical systems. Yet, neural networks are susceptible to adversarial examples, carefully perturbed inputs that cause networks to misbehave in arbitrarily chosen ways. When generated with standard methods, these examples do not consistently fool a classifier in the physical world due t...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

CoRR

دوره abs/1711.00117 شماره

صفحات -

تاریخ انتشار 2017

Countering Adversarial Images using Input Transformations

نویسندگان

چکیده

منابع مشابه

Countering Adversarial Images

Countering Adversarial Images

Countering Adversarial Images

Improving Transferability of Adversarial Examples with Input Diversity

Synthesizing Robust Adversarial Examples

عنوان ژورنال:

اشتراک گذاری